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1.  Introduction 

1.1.  The  usual  applications  of  the  method  of  Lagrangian  multipliers,  used  in  locating 
constrained  extrema  (say  maxima),  involve  the  setting  up  of  the  Lagrangian  expression , 

(1)  4>(x>  y)  =  /(*)  +  /g(x)  i 

where  f(x)  is  being  (say)  maximized  with  respect  to  the  (vector)  variable  x  =  [xi,  •  •  •  , 

} ,  subject  to  the  constraint  g(x)  —  0,  where  g(x)  maps  the  points  of  the  ^-dimensional 
z-space  into  an  Af -dimensional  space,  and  y  =  {yi,  ‘  ‘  ,  Jm)  is  the  Lagrange  multiplier 
(vector).  Here,  {  }  indicates  a  column  vector;  the  prime  indicates  transposition,  so 

that  y'  is  a  row  vector. 

The  essential  step  of  the  customary  procedure^  the  solution  for  x,  as  well  as  y,  of  the 
pair  of  (vector)  equations,  ' 

(2)  <f>x(x,  y)  =  0,  g(x)  =  0 , 

where  <f>x(x,  y)  =  [d<f>(x,  y)/dxh  •  •  •  ,  d<f>(x,  y)/dxN\.  Let  (x,  y)  be  the  solutions  of 
equations  (2),  while  £  maximizes  f(x)  subject  to  g(x)  =  0.  Then,  under  suitable  restric¬ 
tions, 

(3)  x  =  £ . 

1.2.  In  [1]  Kuhn  and  Tucker  treat  the  related  problem  of  maximizing  f(x)  subject  to 
the  constraints1  g(x)  0,  x  0,  where,  for  an  arbitrary  K-dimensional  vector  a  = 
{ah  •  •  ■  ,  aK),  the  relation  a  ^  0  is  here  defined  to  mean  a*  0  for  k  —  1,  •  •  •  ,  K. 
Another  definition  of  vectorial  inequalities,  permitting  greater  generality  of  treatment, 
will  be  used  in  later  sections  of  this  paper.  There  we  shall  treat  directly  the  class  of  situa¬ 
tions  where  f(x)  is  to  be  maximized  subject  to  gw(x)  ^  0,  g(2)(x)  =  0,  s111  ^  0,  xl2]  not 
restricted  as  to  sign,  *  =  |*w,  x[2] } . 

Denote  by  C„  the  set  of  all  x  satisfying  the  constraints  g(x)  ^  0,  x  ^  0.  The  two  re¬ 
sults  stated  below  are  of  fundamental  importance  for  the  problem  considered. 

(A)  (See  theorem  1  [1].)  Let  g  satisfy  the  following  condition  (called  Constraint 

Most  of  the  work  of  this  paper  was  done  under  the  auspices  of  The  RAND  Corporation,  with  addi¬ 
tional  support  and  assistance  from  the  Cowles  Commission  for  Research  in  Economics  and  the  Office 
of  Naval  Research. 

1  In  [lj  our  /  and  g  are  respectively  written  as  g  and  F.  The  symbol  in  [1]  for  the  Lagrange  multiplier 
(our  y)  is  «. 
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Qualification,  here  abbreviated  as  C.Q.).2  If  x  is  a  boundary  point  of  C„  and  x  satisfies 
the  relations, 

(4)  gax(x  -  x)  ^  0  , 

(5)  xb  -  x^^O, 

where  “  -  ”  over  a  symbol  denotes  its  evaluation  at  x  —  x,  g  =  {g°,  g0j ,  ga  =  0,  g0  >  0, 
x  =  {x*,  £*},  >  0,  and  P  =  0,  then  there  exists  a  differentiable  vector- valued  func¬ 

tion  \p  of  the  real  variable  0  whose  domain  is  the  closed  interval  (0,  1)  and  the  range  is 
in  C0;  that  is,  x  =  p(6),  such  that  ^(0)  =  x  and  ^'(0)  =  \(x  —  x)  for  some  positive 
scalar  X. 

Under  this  condition,  if  all  derivatives  used  below  exist  and  if  x  maximizes  f(x)  for 
x  6  C0,  there  exists  y  satisfying  the  conditions 


(6) 

9 

x  ^  0,  <f>x  ^  0, 

x'(j>x  =  0 , 

(7) 

3^  =  0,  ^^0, 

y'4>v  =  o , 

where  <j>x  and  4>v  are  partial  (vector)  derivatives  of  the  Lagrangian  expression  (1)  evalu¬ 
ated  at  ( x ,  y). 

(B).  (See  theorem  3  [1].)  If  the  hypotheses  specified  in  (A)  hold  and,  in  addition,  the 
functions /(z),  gm(x),  m  —  1,  •  •  •  ,  M  are  concave ,8  there  exists  a  pair  (x,  y),  satisfying 
conditions  (6)  and  (7),  such  that  (x,  y)  is  a  nonnegative  saddle-point  (NNSP)  of  <f>(x,  y), 
that  is, 

(8)  <f>(x,  y)  ^  <t>(x,  y)  ^  <f>(x,  y)  for  all  x  ^  0,  y  ^  0  ; 

furthermore,  any  NNSP  (x,  y)  of  <f>(x,  y)  has  the  property  that  x  maximizes  f(x)  in  C0. 
According  to  lemma  1  [1],  conditions  (6),  (7)  are  implied  by  (8)  regardless  of  the  nature 
of  <j)(x,  y),  that  is,  even  if  <j>(x,  y)  is  not  given  by  (1). 

2.  A  modified  Lagrangian  approach 

2.1.  Because  of  the  interesting  game  theoretical  and  economic  implications  of  the 
theorem  in  (B),  section  1.2  (which  the  authors  will  study  elsewhere),  the  question  arises 
as  to  the  possibility  of  similar  results  when  some  of  the  conditions  of  the  theorem  are 
relaxed. 

It  turns  out  that  results  of  such  nature  can  be  obtained,  though  not  without  some 
sacrifices.  The  relaxation  is  primarily  with  regard  to  the  convexity  assumptions  which 
fail  to  hold  in  some  important  economic  applications  (the  case  of  “increasing  returns”). 
The  main  sacrifices  are  (1)  the  Lagrangian  expression  is  modified,  and  (2)  the  results 
are  proved  only  locally. 

The  results  are  presented  below  in  the  form  of  three  theorems.  Theorem  1  is  auxiliary 
in  nature;  theorems  2  and  3  together  imply  the  existence  of  a  local  nonnegative  saddle- 

2  This  restriction  “is  designed  to  rule  out  singularities  on  the  boundary  of  the  constraint  set,  such  as 
an  outward-pointing  ‘cusp’  ”  (see  p.  483  in  [1]).  It  should  be  noted,  however,  that  because  of  (4),  C.Q.  is 
a  property  of  g,  not  merely  of  C„.  Thus  g(x)  =  —(x—  1)*,  x  one-dimensional,  lacks  C.Q.,  while  g(x)  = 
—  (x—  1),  with  the  same  C„,  does  have  it. 

s  A  function  f(x)  is  said  to  be  concave  if 

(1  -  0)/(*°)  +  »/(*)  ^  /Id  “  0)*°  +  ex] 

for  all  0  ^  0  ^  1  and  all  x°  and  *  in  the  region  where  fix)  is  defined  (see  [1],  pp.  10-11). 
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point  for  the  modified  Lagrangian  expression.  Theorem  3  shows  this  saddle-point  to  be 
of  the  type  leading  to  convergence  in  gradient  procedures  described  by  the  authors  in  [3]. 

The  notation  differs  in  some  detail  from  that  introduced  in  section  1.  To  facilitate 
reading,  some  notational  principles  are  stated  in  2.2.1;  the  main  symbols  used  are  listed 
in  sections  2.2.2  and  2.3.4. 

2.2.1.  Some  principles  of  notation.  A  A'-dimensional  column  vector  {ah  a2)  •  ■  ■  ,  aK) 
is  denoted  by  a;  dim  a  denotes  the  number  of  components  in  a.  If  A  is  a  matrix,  A'  is  its 

K 

transpose.  Hence,  in  particular,  a'  is  a  row  vector  and  a'  b  =  akbk  is  the  inner  prod- 

A“1 

uct  of  the  vectors  a  and  b;  a-b  is  an  alternative,  and  sometimes  more  convenient,  nota¬ 
tion  for  a'b. 

[ah  a2,  •  •  •  ,  ax]  is  the  finite  (unordered)  set  whose  elements  are  aly  a2,  •  •  •  ,  aK. 
A  ~  B  is  the  set  of  all  elements  in  A  but  not  in  B  (the  set-theoretic  difference). 

{x  \px\  denotes  the  set  of  all  x  possessing  the  property  px. 

If 


(9) 

(10) 

then 

(ID 


c(a)  =  {cx{a)y  c2(a),  *  ,  cP(a )}  , 

a  =  {ai,  a2,  •  •  •  ,  an)  , 


ca  =  ca  (a) 


d  cP 
dak 


p  =  1,  2,  ■  ,P  ;  k=l,2,  -,K. 


F  urther,  c,  ca  denote,  respectively,  c(a)  and  ca(q)  =  ca  evaluated  at  a  =  a. 

If  p(a,  b)  is  a  real-valued  (scalar)  function  of  the  vectors  a  —  {ai,  a2,  •  •  •  ,  aK), 
b  =  {6i,  62,  •  •  •  ,  bR },  then 


(12) 


tab  = 


d2t 

dakdbr  ’ 


k  —  l,  2,  ■  ■  ■  ,  K  ;  r  =  1,2  •••,!?, 


where  tab  denotes  tab  evaluated  at  (a,  b ). 

Sp(x°)  =  {x\d(x,  ^°)  tk  p}  where  d(x',  x")  denotes  the  Euclidean  distance  between 
x'  and  x". 

2.2.2.  Some  symbols  used. 


(N.1.1) 


x  =  {oji,  x2,  •  ‘  •  ,  Sat}  . 


X  is  the  Euclidean  iV-space  of  the  z’s  . 

^  =  [1,  2,  •  •  •  ,  N] . 


ft'  is  a  fixed  (possibly  empty,  not  necessarily  proper)  subset  of  ft.  As  will  be  seen  in 
(N.1.4),  the  elements  of  ft'  are  the  indices  of  the  components  of  s111  as  defined  in  the 
first  paragraph  of  section  1.2. 

(N.1.2)  z  =  {zi,  z2,  •  •  •  ,  ZAf}  . 

•  Z  is  the  Euclidean  M-space  of  the  z’s  . 

[1,  2,  •  •  •  ,  M] . 

yf/'  is  a  fixed  (possibly  empty,  not  necessarily  proper)  subset  of  ftf.  As  will  be  seen  from 
(N.1.4) ,  (N.2),  (N.3),  the  elements  of  yf/'  are  the  indices  of  the  components  of  g(1)  as 
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defined  in  the  first  paragraph  of  section  1.2;  the  elements  of  Af~  Aff  are  the  indices 
of  g®  (see  same  paragraph);  g  will  be  defined  as  {gW,  g<2) } . 

(N-.1.3)  y  =  {yi,yt,  •  •  •  ,yM)  * 

F  is  the  Euclidean  Jf -space  of  the  y’s.  Here  Y  is  the  space  of  the  real-valued  linear 
functions  on  Z.  Even  in  the  Euclidean  case  it  is  convenient  to  distinguish  between  the 
two,  since  our  definitions  of  nonnegativity  in  the  two  spaces  differ. 


(N.1.4) 


x  ^  0  means 
X+  is  the  sel 
z^O  means 

y  ^  0  means 


xn  ^  0  for  n£  J\f' . 
xn  unrestricted  as  to  sign  for  n  £  /\f' . 
X+  is  the  set  of  all  x  ^  0  . 

zm  =  0  for  m  £  M' * 
zm  =  0  for  m  £  M'  • 

^  o  for  tn€Aff- 

y  m  unrestricted  as  to  sign  for  m  £  yf/' . 
For  any  vector  a  =  {ai,  aj,  *  •  ■  ,  a*}  , 


a  =  0  means  fli  =  0,  a2  =  0,  *  •  •  ,  ax  =  0  ; 


a  >  0  means  a\  >  0,  a*  >  0,  •  •  •  ,  aK  >  0  ; 
a  <  0  means  —a  >  0  . 

(N.2.1)  'g  is  a  function  on  X+  to  Z.  Hence  'g(x)  =  {'gi(x),  'g2(x),  • 
the  'gm,  m  €  yf/are  real-valued  functions. 

(N.2.2)  We  shall  find  it  convenient  to  work  with  some  of  the  'gm,  m 
their  negatives.  More  precisely,  we  write 


r  'gm  if  m£At~Af~ 

l  -  'gm  if  m€AT, 


•  •  ,  'gifix)}  where 
£  Jtf'  replaced  by 


where  AfQAf  AT  will  be  defined  in  section  2.3.4. 

8  —  {&i>  gz> '  *  '  >  8m)  . 

Note.  Since  AfQAf-Af',  it  is  seen  that  the  conditions 

'*(*)  ^  0,  g(x)  ^  0 

are  equivalent.  For  practical  purposes,  one  could  consider  the  problem  as  given  directly 
in  terms  of  g,  rather  than  'g.  We  start  with  'g,  however,  in  order  to  avoid  the  impression 
of  a  loss  of  generality  in  connection  with  the  assumptions  of  section  2.3.4. 

(N.3)  CB  =  {*| 'g(x)  ^  0,  x  ^  0}  a  |x|g(x)  ^  0,  x  ^  0} 

(the  “constraint  set”). 

(N.4)  /  is  a  real-valued  function  on  X+  (the  “maximand”) . 

(N.5)  O/g  —  [xf  \x'  £  Cg  and  f(x)  |  /(pc')  for  all  x  £  Cg\ 

(the  “optimal  set”). 
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(N.6)  x  =  xW}  where 

=  the  set  of  indices  of  the  components  of  x^\  i—  1,2 

n  €  At^  if  n  C  At'  or  n  £  At'  and  x n  >  0 

n  6  At®  if  n  £  At'  and  =  0 

for  a  given  x  £  O/0  and  either  component  may  be  empty. 

Note  1.  When  a  vector  a  is  partitioned  into  two  sub  vectors,  say 

a  =  [a*,  a,**) 

and  we  say  that  a *  (or  a**)  is  empty,  this  means  that  a  =  a**  (or  a  —  a*). 

Note  2.  The  above  partitioning  of  the  vector  x  obviously  depends  on  the  point  x  in 
Of0  chosen.  The  same  is  true  of  the  partitioning  in  (N.7)  below  and  of  various  subsequent 
partitionings  of  x  and  g.  It  is  understood  that  all  these  partitionings  refer  to  the  same 
choice  of  x ,  and  that  x,  once  chosen,  remains  fixed. 

(N.7)  g=  [gmtgm } 

where 

gI1](*)  =  0,  gW(x)  >  0 

and  either  component  may  be  empty. 

(N.8)  h(x)  =  1  -  g(x) 

where  1  denotes  the  M-dimensional  vector  with  Ts  as  components;  h[i]  =  1  —  g[il, 
i=  1,  2. 

(N.9)  Vmpm(x)  =  1  -  [hm(x)]l+r>m}  W  €  Af  ■ 

(N.10)  rj  -  {in,  rjt,  •  •  •  ,  tjm}  . 

(N.l  1)  vp(x)  =  [i?i/>i(z),  V2p2(x),  •  •  *  ,  r]pM(x)}  . 

(N.  12)  v<f>(x,  y )  =  f(x)  +  y'[np{x)\  (the  “modified  Lagrangian  expression”)  . 

2.3.1.  A  reformulation  of  Kuhn-Tucker  theorem  1.  This  slight  generalization  of  theo¬ 
rem  1  (see  [1],  p.  484)  is  needed  here  because  of  the  meaning  of  inequalities  given  in 
(N.l. 4).  [The  possibility  of  this  type  of  generalization  is  indicated  in  [1]  (see  pp.  491- 
492)  .]4 

We  shall  say  that  g  satisfies  the  Constraint  Qualification  (C.Q.)  at  x,  if  the  require¬ 
ments  of  the  definition  in  (A)  of  section  1.2  are  satisfied  with  the  inequalities  (4),  (5)  in 
the  same  section  interpreted  in  the  sense  of  (N.l. 4).  <j>(x,  y)  is  given  by  (1)  in  1.1.  (It  is 
immaterial  whether  g  or  'g  is  used.) 

Theorem.  If  f  and  g  are  differentiable,  x  £  O/0  and  g  satisfy  C.Q.  at  x,  then  there  exists 
a  y  £  Y  such  that 

y  ^  0;  <t>v-y  =  0;  <pv%y  =  0  for  all  y  ^  0  ; 
x  ^  0;  4>x’X  =  0;  4>x-x  ^  0  for  all  x"^  0  . 

[Note  that,  by  virtue  of  the  definitions  in  2.2.2,  this  means  that  <f)Vm  ^  0  if  m  eAf', 
4>Vm  =  0  if  m  $  Af',  <t>xn  ^  0  if  n  £  At',  =  0  if  n  $  At'-  The  other  inequalities  of 
the  theorem  are  also  to  be  interpreted  in  the  sense  of  (N.1.4).] 

*  See  also  Hurwicz  [9],  pp.  VIII  —  2-6. 
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2.3.2.  Theorem  1. 

Definition.6  An  M -dimensional  vector  =  [vi>  It, ,  Vm}  is  said  to  be  acceptable 
if,  for  each  m  6  ,  (1)  rjm  ^  0,  and  (2)  rjm  is  an  even  integer  if  hm(x)  <  —  1. 

Theorem  1.  If ,  for  some  p  >  0,  x  £  Sp(x),  x  £  O/0t  f  and  g  are  differentiable ,  and  g 
satisfies  C.Q.  at  x,  then ,  for  any  acceptable  ij,  there  exists  a  vector  y  —  y(r))  such  that 


(13) 

, 4>X’X  is  0  for  all  x  ^  0  ; 

(14) 

jpx'X  =  0  ; 

(15) 

*  ^  0  ; 

(16) 

nflivy  ^  0  for  all  y  ^  0  ; 

(17) 

jpy'y  —  0  ; 

(18) 

y  ^  0. 

The  bar  over  <j>  denotes  evaluation  at  x  =  x,  y  =  y(rj). 

Note  that  the  relations  (13)— (18)  are  necessary  conditions  for  a  nonnegative,  in  the 
sense  of  (N.1.4),  saddle-point  of  v<t>(x,  y)  at  (x,  y).  In  particular,  the  relations  (13)— (18) 
are  satisfied  if  one  selects  y  =  y(rj)  such  that® 

(19)  (1  +  Vm)ym(v)  =  ym(0)  ior  all  m  e  M • 

If  the  selection  is  made  in  accordance  with  (19),  the  equality 

(20)  o <f>*  —  ifpz 

will  hold.  Here  0<t>(x,  y)  is  ,</>(«,  y)  with  =  0;  this  is  obviously  the  same  as  <t>{x,  y)  in  (1) 
of  1.1. 

Proof.  For  rj  —  0,  the  preceding  theorem  follows  directly  from  the  reformulated  ver¬ 
sion  of  the  Kuhn-Tucker  theorem  1  given  in  2.3.1.  Thus  there  exists  a  vector 

(21)  m=  {yi(0),M0)r--,M0)} 

with  the  required  properties. 

Consider  now  the  case  ij  0.  We  shall  show  that  y(rj)  defined  by  (19),  that  is,  explicit- 
ly,  by 

(22)  y-W)  =tx—  y-(0), 

S~r  Vm 

[where  ym(0)  is  that  of  (21)],  satisfies  the  relations  (13)-(20). 

We  first  observe  that  (22)  yields 

(23)  (1  +  Tjm)ym(v)[hm(x)],lm  =  ym(0) ,  m  6  M  • 

[When  hm{x)  =  1,  (23)  follows  directly  from  (22).  When  hm(x)  1,  we  have  o<j>Vn  = 
gm(x)  >  0,  and  hence,  by  (16)— (18),  ym( 0)  =  0;  (22)  then  yields  ym(rj)  =  0  and  (23)  fol¬ 
lows.] 

Since 

(24)  (l  +  ’I-)y-(>l)[*-(*)rm%^-, 

8  In  many  applied  problems,  A«(*)  ^  0  for  all  m  and  all  *  ^  0.  It  was  pointed  out  by  Dr.  Masao  Fu¬ 
kuoka  that,  in  the  absence  of  such  an  assumption,  the  requirement  of  nonnegativity  of  the  components  of 
ij  is  insufficient  for  the  proof  of  the  theorem. 

*  Vm(0)  =  y  in  Kuhn-Tucker  theorem  1  (see  2.3.1). 


formula  (23)  implies 
(25) 
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M 


1$xn  fxn  "t“  y  m  (0) 

m=l 


dgm  (X) 
dxn 


»  €  >V- 


Noting  that  the  right  member  of  (25)  is  identical  with  we  conclude  that  the  re¬ 
lations  (13)— (15)  hold  for  all  ij  with  nonnegative  components,  since  they  are  known  to 
hold  for  r\  =  0. 

Relation  (16)  is  established  by  the  fact  that  the  right  member  of 

(26)  ,4>Vm  =  r,pm(x)  =  1  -  [hm(x)]l+*’* ,  m  e  Af, 

is  nonnegative  for  m  £  zero  for  m  £  M'  when  ij  is  acceptable  (see  the  definition 
above)  since,  for  any  m  6  M ■>  K%{x)  ^  1,  and,  furthermore,  v<f>Um  =  0  if  m  €  in 
which  case  hm(x)  —  1. 

Now  suppose  that,  for  some  mo  £  Mi  *i<£vmo  >  0,  that  is,  hmo(x)  <  1;  then,  by  (16)- 
(18)  for  rj  —  0,  ym„(0)  =  0;  hence  =  0,  and,  therefore, 

(27)  0lfm.*5'mo(l?)  =  0. 

Since  (27)  clearly  holds  in  the  alternative  case  ,0ma  =  0,  (17)  follows. 

Finally,  (18)  holds  because  ym(r j)  has  the  same  sign  as  ym(0)  and  the  latter,  by  (18) 
for  77  =  0,  is  nonnegative  if  m  £  M- 

2.3.3.  Theorem  2.  Let,  for  some  p  >  0,  x  £  Sp(x),  x  £  O/0,  such  that  (13)-(20)  are  satis¬ 
fied.  Then 

(28)  v<t>(x,  y)  ^  v4>(x,  y)  for  all  y  ^  0  . 

For  we  have 

(29)  v4>(x,  y)  -  „<*>(*,  y)  =  (y-  y)-vp  =  y*P  ^  o  for  y  ^  0 
where,  since 

(30)  y,(f>y  =  nP  , 

the  second  equality  follows  from  (17)  and  the  inequality  from  (16). 

2.3.4.  Notation. 

(N.13)  =  {*(»),*«>} 

where 

0 <£*(«>)  =  0 ,  o0*(»)  <  0 


and  either  component  may  be  empty. 

(N.  14)  x={xi,  xu\ 


where 

(N.14.1) 


X1  —  {^1))  #(21>J 


XU  —  x{22)  _ 


(Either  xr  or  x11  may  be  empty.) 

It  should  be  noted  that,  by  (13)— (15)  and  (N.13), 

0<t>xJ  =  0  , 


0  <t>X*l  <  0 


(N.14.2) 
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2.3.5.  Definition  of  a  regular  constrained  maximum.  In  theorem  3  below  we  use  the  con¬ 
cept  of  a  regular  constrained  maximum.  The  definition  of  such  a  maximum  is  given  in 
the  last  part  of  this  section.  To  state  it,  we  must  first  formulate  three  regularity  condi¬ 
tions  denoted  by  Ri,  R2,  R3. 

The  first  regularity  condition  Ri.  Let  x  be  a  value  maximizing  the  function  f(x)  subject 
to  'g(x)  ^  0,  x  ^  0,  and  hence  also  subject  to 

g(*)  ^  0 

(31) 

x  ^  0 

where  the  inequalities  are  to  be  interpreted  in  the  sense  of  (N.1.4). 

From  (N.6)  and  (N.7)  it  is  clear  that,  for  sufficiently  small  variations  of  x,  the  con¬ 
straints 

gI21C*0  ^  0 

(32) 

xW  ^  0 , 

which  are  a  part  of  (31),  can  be  disregarded.  Hence,  at  x,f(x)  possesses  a  local  maximum 
subject  to 

gI1](s)  ^  0, 

(33) 

x&  £  0 . 

Let  gt  be  a  subvector  of  g(l1  such  that  C„  =  C(0tiflMj  and  write 

(34)  g[11=  |gt,gtt}. 

The  components  of  gtt  can  be  disregarded  in  the  'process  of  maximization,  that  is, 
Of, a  =  If  the  Lagrangian  multiplier  vector  y[1)  (corresponding  to  the  con¬ 

straints  g[U  (x)  ^  0)  is  partitioned  according  to 

(35)  ym  =  {yt,  ytt} 
it  is  always  possible  to  put 

(36)  ytt  =  0 , 
and  this  will  be  done  in  what  follows. 

Assuming  that  the  constraints  (33)  are  consistent,  we  may  replace  them  by 

gf(*)  =  0 

(37) 

X®  ^  0 . 

The  first  regularity  condition  is 
(Ri)  rank  (g|(«))  =  dim  gt  =  Aft , 

say. 

Note  1.  Ri  corresponds  to  the  requirement  of  nondegeneracy  in  linear  programming 
(see  [4],  p.  340). 

Note  2.  Ri  implies  C.Q.  (see  appendix  I). 

The  second  regularity  condition  R2.  Since,  by  (N.7),  (N.6),  (N.14.1),  and  (34), 

gt(s)  =  0 

x11  =  0 , 


(38) 
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it  follows  that,  as  a  function  of  xr,  f(xT,  x11)  =  f(xT,  0)  has  at  x1  a  local  maximum  subject 
to  the  constraints 


(39) 


gi(xJ,  xIJ)  =  g*(xT,  0)  =0 


*(21>  ^  0 . 


The  corresponding  Lagrangian  expression  becomes 

(40)  o tfix1,  yt)  =  f(xT,  0)  +  y'-gKx1,  0) . 

Using  the  reformulation  of  Kuhn-Tucker  theorem  1,  given  in  2.3.1,  we  may  assert  the 
existence  of  a  yt  such  that 

(41)  *7^0;  o^  =  0; 

(42)  yt  =  0;  o^vt  =  0  . 

It  might  happen  that  some  components  of  yt  vanish.  Write  yt  =  {-y*)  y°}  where 
every  component  of  y*  is  different  from  zero  and 

(43)  y°  =  0. 

Let  gt  be  correspondingly  partitioned  as 

(44)  gt  =  {g*,  g°}  . 

Now  suppose  that  otfix1,  yt)  has  a  nonnegative  saddle-point  at  (x1,  yt).  By  theorem  3  in 
Kuhn-Tucker,  a  sufficient  condition  for  this  is  that /  and  g  be  both  concave.  One  can  then 
easily  verify  that 

(45)  •  o4>'(*',  y *)  =/(*',  0)  +  y*’g*(xI,  0) 

has  a  nonnegative  saddle-point  at  (x1,  y*). 

But  then  x1  maximizes  f(xT,  0)  subject  to  g*(xl,  0)^0  and  x(21)  ^  0.  Hence  in  this 
case  the  components  of  g°  could  have  been  disregarded  in  the  original  maximation  prob¬ 
lem  (Ofi0  =  O/,  (*.,*[.])). 

However,  complications  might  arise  if  otfix1,  yt)  did  not  have  a  nonnegative  saddle- 
point  at  (x1,  yt).  To  take  care  of  this  case,  one  might  require  that 

(46)  g°  is  empty  unless  o<£7(*7,  yt)  has  a  local  nonnegative  saddle-point  at  (x1,  yt) . 
However,  to  simplify  matters  we  shall  impose  the  seemingly7  stronger  condition 

(47)  g°  is  empty  . 

It  follows  that 

(48)  M*  ss  dim  g*  =  dim  gt  =  Mt . 

Let  A /*  [=  Af*  by  (47)]  denote  the  set  of  indices  of  g*.  Clearly,  for  tn  G  Af*  n 
(Af~Af  '),  we  may  have  ym  <  0. 

Now  suppose  the  preceding  reasoning  had  been  carried  out  in  terms  of  'g  instead  of  g. 
Nothing  would  be  changed,  except,  possibly,  the  signs  of  some  components  of  the  La¬ 
grangian  multiplier,  to  be  denoted  by  'y. 

That  is,  we  would  have  'ym  >  0  for  m  €  Af*  n  Af'  and  %  >  0  or  %  <  0  for 


7  See  section  2.3.7. 
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m  €  M*  n  Let  be  defined  by  the  relation  m  £  M~  if  and  only 

if  m  €  M*  n  (Af~M ')  and  'yOT  <  0.  Then,  it  is  clear  from  (N.2.2)  that  we  may  put 

jm  =  %  for  m  £  M~ 

(49) 

ym  =  — %  for  w  £  >{/' , 


so  that  ym  >  0  for  all  m  €>#/*• 

Hence,  without  loss  of  generality  [as  compared  with  (47)]  condition  (47)  may  be  re¬ 
stated  as  the  second  regularity  condition, 


(R2) 


g°  is  empty  and 

ym  >  0  for  all  M  £  M*  • 


The  first  regularity  condition  then  implies 


(50) 

rank  (g£o>)  =  M* 

where 

(51) 

M*  =  dim  g*  . 

The  third  regularity  condition  R3.  When  the  first  two  regularity  conditions  are  satis¬ 
fied,  second  derivatives  are  continuous,  and  x1  is  nonempty,  it  is  possible  to  show  (see 
appendix  II)  that  a  certain  quadratic  form  is  nonpositive  when  some  of  the  variables 
are  restricted  in  sign.  The  third  regularity  condition  is  a  strengthening  of  (71)  requiring 
that  the  quadratic  form  in  question  be  negative  under  the  same  restrictions.  This  con¬ 
dition,  analogous  to  that  used  by  Samuelson  (see  [5],  p.  358)  makes  it  possible  to  avoid 
going  beyond  second  order  terms  in  the  expansions  used. 

The  third  regularity  condition  is  formulated  in  terms  of  a  function  q{t)  of  a  new  vari¬ 
able  vector 

(52)  <={**,  f**} 

which  is  obtained  by  a  transformation  of  coordinates  from  x1  after  the  latter  has  been 
partitioned  so  that 

(53)  x1  =  {**,  ***}  , 
where  x*  is  a  subvector  of  x*15. 

We  shall  (a)  define  x*  and  x**;  (b)  write  down  the  transformation  defining  {/*,  /**} 
in  terms  of  {x*,  x**} ;  (c)  define  q(t) ;  and  (d)  formulate  the  third  regularity  condition. 

In  the  remainder  of  this  section  it  is  assumed  that  Ri  holds;  it  is  also  assumed  that  x1 
is  not  empty. 

First  case:  M*  =  0.  Write 


(54)  t=t**  =  x**  =  x1 , 

so  that,  by  (52)  and  (53),  x*  and  t*  are  empty,  and  define 

(55)  q(t)  =/(x',x")  =/(***,  0). 

The  third  regularity  condition  for  this  case  is  formulated  in  R3  below. 

Second  case:  M*  >  0.  (a)  The  definition  of  x*.  From  Ri  it  follows  that  there  exists  a 
(nonempty)  M *-dimensional  subvector  x*  of  x(1)  such  that 

(56)  gt*  is  an  M*  by  M*  (M*  ^  1)  nonsingular  matrix  . 
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We  then  define  x**  by  (53)  and  x^  by 

(57)  *(*>  =  {**,  x^}  . 

Clearly 

(58)  x**={xM,xW}. 

(b)  The  transformation  from  x1  to  t.  Let 

(59)  h*  =  1  -  g* 

where  1  is  the  M*-dimensional  vector  with  (scalar)  l’s  as  components,  t  —  {/*,  /**}  is 
then  defined  by  the  transformation 

(60)  l*  =  h*(x*,  x **,  x11) 

(61)  /**  =  x**  . 


We  also  partition  t**  by 

(62)  /**=  {/(»),  <(“)} 
where 


(63) 


/(12)  =  ^(12)  } 
/(21)  =  *(21)  # 


This  is  obviously  consistent  with  (57)  and  (61). 

(c)  The  definition  of  q(t).  By  (59),  the  Jacobian  H  of  the  transformation  (58)-(59)  is 

....  „  /**.  K.\  (C  C.\ 

so  that,  by  (56), 

(65)  \H\  =  -|-&|*0, 
that  is, 

(66)  H  is  nonsingular  . 

Hence,  locally,  (60)-(61)  can  be  solved  for  x1  in  terms  of  t;  we  may  write  this  solu- 


tion  as 

(67) 

x1  =  r(t) 

where 

(68) 

r  =  [r*  r**} 

and 

(69) 

x*  =  r*(t),  x**  =  r**\ 

The  function  q(t )  is  now  defined  as  f{x)  evaluated  at  x11  =  x11  and  with  x1  expressed 
in  terms  of  t,  that  is, 

(70)  q(t)  =  f[r(t),  x11]  =  f[r*(t*,  /**),  /**,  0] . 


The  statement  of  the  third  regularity  condition.  We  have  now  defined  q(t)  for  all  M*  pro¬ 
vided  the  first  regularity  condition  Rj  is  satisfied  and  x1  is  nonempty.  It  is  shown  in  ap- 
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pendix  II  that,  assuming  Ri,  R2,  and  the  continuity  of  the  second  derivatives,  unless  x** 
is  empty,  there  exists  p  >  0  such  that,  for  all  t**  6  Sp(x**), 

(7  1)  (/**  -  x**Y  qt**t**  (/**  -  ***)  ^  0 ,  if  ^  0. 

The  third  regularity  condition  is  a  strengthening  of  the  preceding  inequality.  It  states 
that 

(R3)  (a)  x**  is  empty  or 

(b)  there  exists  p  >  0  such  that,  for  all  t**  6  Sp(x**),  ( t **  —  x**)'  qt**t**(t**  — 
x**)  <  0  if  f<21>  ^  0  and  l**  ^  x**. 

Note.  The  situation  covered  by  (a)  of  R3  is  of  importance  since  it  permits  the  treat¬ 
ment  of  a  large  class  of  cases  where  /  and  g  are  linear. 

Definition.  f{x)  is  said  to  have  a  regular  maximum  at  x  subject  to  g(x)  ^  0,  x  ^  0,  if 
the  three  regularity  conditions  Ri,  R2,  R3  are  satisfied  at  x  and  x  6  0/„. 

2.3.6.  Theorem  3.  If,  for  some  p  >  0,  x  6  S„(x),  x  a  regular  maximum 8  of  f(x)  subject 
to  g(x)  ^  0  and  x  ^  0,  /  and  g  are  differentiable  {with  regard  to  x),  and  furthermore,  when 
x1  is  nonempty,  have  continuous  second  order  derivatives  with  regard  to  x1,  then,  for  all 
acceptable 9  77  sufficiently  large  in  each  component, 

(72)  x1  is  empty  , 
or 

(73)  (x1  —  xI)'r,<j)xixt(xT  —  xT)  <  0  if  £(21)  ^  0,  x1  x1 , 
and  for  some  p  >  0,  and  all  x  €  Sp'{x)  such  that  x  ^  0,  x  5^  x  , 

(7  4)  n<t>[x,  y(v) ]  <  5>(t?)] 

where  v<j>  and  y{rj)  are  defined  as  in  theorem  1. 

Note.10  Theorem  3  is  valid  for/,  g  linear  if  x**  is  empty  (regardless  of  whether  x*  is 
empty),  provided  the  first  two  regularity  conditions  hold.  However,  if  both  x*  and  x** 
are  empty,  xr  is  empty,  and  the  theorem  follows  from  the  first  case  considered  below.  If 
x**  is  empty  while  x*  is  nonempty,  use  the  first  two  cases  below  together  with  (90) 
(since  g*  is  nonempty  and  /**  is  empty).  Note  that  x**  is  empty  at  the  basic  solutions 
of  a  linear  programming  problem. 

2.3.7.  Proof  of  theorem  3.  First  it  is  shown  that  (72)  or  (73)  implies  (74).  Then  it  is 
shown  that  (72)  or  (73)  is  true. 

It  can  be  seen  that  if  theorem  3  is  established  for  the  case  of  {gtt;  g0}  empty,  then 
theorem  3  is  also  true  if  (i)  gtt  is  not  empty,  and/or  (ii)  g°  is  not  empty  but  otfix1,  yt) 
has  a  nonnegative  saddle-point  at  (x1,  yt),  since  in  either  case  x  remains  unchanged  and 
the  additional  terms  in  the  modified  Lagrangian  expression  vanish  at  y  [compare  equa¬ 
tions  (36)  and  (43)]. 

Hence,  with  no  loss  of  generality,  we  may  henceforth  assume  ]gtt?  g°j  to  be  empty, 
that  is, 

(75)  glll  =  g*. 

We  now  show  that  (72)  or  (73)  implies  (74),  that  is,  that  in  a  sufficiently  small  neigh¬ 
borhood,  if  (72)  or  (73)  is  assumed  to  be  valid  and  the  inequalities  x  ^  0,  x  ^  x,  hold, 

8  The  term  “regular  maximum”  is  defined  at  the  end  of  section  2.3.5. 

*  The  term  “acceptable”  is  defined  at  the  beginning  of  section  2.3.2. 

10  The  desirability  of  explicit  treatment  of  the  linear  case  was  emphasized  by  Dr.  Masao  Fukuoka. 
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the  conclusion  of  (74)  follows.  We  write  <f>  instead  of  throughout.  Also  (72)  or  (73), 
x  ^  0,  x  7*  x,  is  assumed. 

Let 

(76)  £  =  x  —  x 

(77)  F  =  s*  -  x' ,  *=/,  //. 

Firs/  case:  £/z  5^  0.  By  (20)  and  (N.14.2), 

(78)  <£*•£  =  <f>zf£T  4*  ^"•F7  <  0  . 

But  then  the  conclusion  of  (74)  follows  from  the  well-known  “Frechet”  property  of 
differentials11  which,  as  applied  to  the  present  case,  states  that,  given  any  a  >  0,  there 
exists  an  e  >  0  such  that, 

(79)  |j|j  [<£  (x,  y)  —  <f>  (x,  y)  —  <£*■ £]  <o 

if  |£|  <  e. 

Choose 

(80)  <r  =  —  -j-ij  <£*•  £ 

which  is  positive  by  (78).  Then,  for  a  sufficiently  small  [£  |,  we  have  by  (79) 

(81)  -j-|y  [</>(*>  y)  —  y)  +  <r]  <0- 
which  implies 

(82)  -j-£j  [<£(*>  y)  “<£(*>  y)]  <0 

and  hence  the  conclusion  of  (74). 

If  x 1  is  empty,  this  completes  the  proof  of  the  theorem  3,  since  x  ^  x  then  implies 
£/J  9^  0.  If  xl  is  not  empty,  we  must  consider  the 

Second  case:  £7/  =  0.  Since  it  is  assumed  that  x  ^  x,  £"  =  0  implies 

(83)  £7^0. 

In  virtue  of  the  existence  of  the  second  derivatives  of  <f>  with  regard  to  x1  (by  definition 
of  <f>,  and  the  assumptions  concerning  the  second  derivatives  of  /  and  g  with  regard  to  xT) 
we  have,  by  Taylor’s  theorem, 

(84)  <f>(x,  y)  -  <f>(x,  y)  =  0*/-F  +  KF)'&vF , 

where  <j>xixi  denotes  4>xrxi  evaluated  at  x  =  x,  x  =  x  +  0£,  0  <  6  <  1.  It  now  suffices 
to  note  that  (FX^cvF  is  negative  at  x  [since  (72)  or  (73)  is  assumed  to  hold  and  its  hy¬ 
potheses  are  satisfied]  and  continuous  in  the  neighborhood  of  x  (by  the  hypotheses  of  the 
theorem  concerning  the  second  derivatives  of  /  and  g ),  so  that,  for  a  sufficiently  small 
IF  I,  (F/4 WF  <  0.  Since  <£*j*F  =  0  by  (N.14.2),  (74)  follows. 

We  now  show  that  (72)  holds  if  x1  is  nonempty. 

First  case:  g*  empty.  By  equation  (75),  g[1]  is  also  empty.  Hence,  by  (13)— (15)  in  theo¬ 
rem  1, 

(85)  y(v)  =  yl21(0)  =  0 


11  See  Hille  [10],  p.  72,  definition  4.3.4,  equation  (iii). 
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and,  using  (N.12), 

(86)  „<£[*,  yiii)]  =  f(x) . 

Since  g*  is  empty,  we  have  M*  =  0,  and,  therefore,  the  definition  (55)  of  q  applies, 
so  that  (since  x*  is  empty  but  xx  is  not)  /**  is  not  empty  and 

(87)  <jt**t**  =  fx**X**  =  1} • 

Equations  (86)  and  (87),  together  with  the  third  regularity  condition  R3,  yield  (73)  for 
a  sufficiently  small  neighborhood  of  x. 

Second  case:  g*  nonempty.  Write 

(88)  ip(t,  y)  =  <t>[r(t),  xri,  y] 

where  r(t)  is  defined  in  (67).  (Where  it  is  desired  to  indicate  the  dependence  of  ^  on  ij, 
we  may  write  nyp  instead  of  \p.) 

Then,  by  (66),  that  is,  Ri,  we  have 

(89)  =  ht  |t-i  =  (H-iy^xIxiH-\  l  =  { h*(x),  x**}  , 

since  v<j>xi  =  =  0  by  (20)  and  (N.14.2). 

We  shall  now  show  that  (73)  is  implied  by 

(90)  T'tyttT  <  0,  if  t(29  0,  and  r  ^  0 

where  the  partitioning  of  r  corresponds  to  that  of  t.  We  show  later  that  (90)  holds. 

To  see  that  (90)  implies  (73),  let  x1  satisfy  the  inequalities  x<21)  ^  0,  xT  9*  x1.  Choose 


(91)  (T  )  =  t  =  H(x'-x')  =(kx*  h***\lX  X  \ 

\t**/  \0  I  /\x**-x**/ 

Since,  by  (66),  8  is  nonsingular,  x1  x1  implies  0.  Also,  (91)  yields 

(92)  r** 


**  _  z** 


hence,  in  particular, 

(93)  r<21>  =  *(21>  -  *(21> . 


But 

(94)  ^(29  =  0  , 

since  x(21)  is  a  component  of  x(2)  by  (N.13),  and  x(2)  =  0  by  (N.6).  Hence 

(95)  r<21>  =  x<21> 
and  thus  x(21)  ^  0  implies  r(21)  ^  0. 

Having  shown  that  the  hypotheses  of  (73)  imply  those  of  (90),  we  see  that  the 
hypotheses  of  (73),  together  with  the  validity  of  the  assertion  in  (90),  yield 

(96)  r'xpttT  <  0. 

But,  using  in  succession  (91),  (89),  and  simplifying,  we  have 

(97)  t'xPut  =  (x*  -  xTyH'ipttB(xI  -  x1) 

=  (x*  -  xtyB'iB-'y^xiB-'Bix*  -  *0 
=  (x1  —  X^'r^x^ix1  —  X1)  . 
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Formulas  (96)  and  (97)  yield  the  conclusion  of  (73).  Thus  it  has  been  established 
that  (90)  implies  (73).  It  remains  to  be  shown  that  (90)  is  valid.  It  is  convenient  to  write 
ipu  in  the  partitioned  form 

ip t*t*  ip t*t**  \  /A  B\ 

ipt**t*  ipt**t*v  '-B  O 

where  /**  may  be  empty;  t*  is  assumed  nonempty,  since  the  case  of  t*  empty  was 
treated  earlier. 

It  will  now  be  shown  that  A,  that  is,  ipt*t*  [compare  (98)],  which  depends  on  77,  can 
be  made  negative  definite  by  a  suitable  choice  of  77. 

Recalling  that  yf/*  denotes  the  set  of  indices  of  the  components  of  g*,  and  using 
(N.9)  and  (60),  we  see  that,  for  m  6  yf/*, 

(99)  vnpm(x)  =  1  -  t'+'m 

where  4.  is  a  component  of  t*. 

Since,  by  theorem  1  and  equation  (75), 

( 100)  ym(rj)  =  0  for  m  €  M*  , 

we  have,  from  the  definitions  of  \f/,  q,  and  ,<£  [equations  (88),  (70),  and  (N.12),  respective¬ 
ly],  and  the  preceding  relations  (99)  and  (100),  the  equality 


(98) 


(101)  *[<,  y  (»)!  =  «  W  +  2  IJ-WKl-*1-*")- 

me  M* 

Writing 

(102)  F  -  qt*t*  , 
we  have,  from  (101)  and  the  definition  of  A  that 

(103)  A  =  F  —  D  , 

where  D  =  1 1 dm,m'  |  |,  m  £  Af*>  m'  €  yf/*,  is  a  diagonal  matrix  [that  is,  dm,m’  —  0 
for  m  9*  m’\  with 

(104)  dm,m  ~  [ym(»?)](l  4-  ’nm)i lm  =  ym(0)rjm  ,  m  £  yf/*  , 

where  the  second  equality  follows  from  (19). 

Let  A  denote  the  largest  characteristic  root  of  F.  Since,  by  the  second  regularity  con¬ 
dition  Rs,  ym(0)  >  0  if  m  €  yf/*,  we  may  choose  77° ,  for  each  m  £  yf/*,  to  be  a  positive 
even  integer  satisfying 

(105)  v°m  >  A/%(0)  , 
so  that 

(106)  min  dm,m  >  X 

meM* 

for  all  acceptable  yjm  ^  Vm  • 

Then,  for  any  t*  9*  0,  and  each  acceptable  rjm  ^  77®,,  we  have 


(107) 


—  jLmmj  m  m ,  m  m 

me  M*  m 


=  t*'Dt*  , 
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that  is,  t*  7*0  implies  t*'(F  —  D)t*  <  0  for  all  sufficiently  large  acceptable  77,  or  A  is 
negative  definite  for  all  sufficiently  large  acceptable  rj. 

This  suffices  to  establish  (90)  and,  therefore,  (73)  if  t**  is  empty. 

Now  assume  t**  not  empty.  Write 

//  -  4-h3\ 

(108)  H  /  ) 

and 

(109)  =  P\$ttP  • 

Then  methods  used  to  show  that  (90)  implies  (73)  can  be  used  to  show  that 

(1 10)  <  0  for  w  5^  0,  w(21)  ^  0 


implies  (90).  This  is  because 

(111) 


A~lB 
I  . 


and,  like  its  analogue  8,  performs  an  identity  transformation  on  t**,  so  that  the  condi¬ 
tion  /(21)  ^  0  is  transformed  into  the  condition  wW  ^  0.  It  remains  to  establish  (110). 
Now  from  (109),  (108),  and  (98),  we  have 

/A0  \ 

(112)  „Q  =  ( 

\0  C  —  B'  A~lB/ 

so  that  wfaw  =  w*'Aw*  +  w**'  ( C  —  B'A~1B)w**. 

Now,  we  may  take  A  as  negative  definite,  and  hence,  to  establish  (110),  it  will  suffice 
to  show  that 

(113)  Q  -  w**'(C  -  B' A~1B)w**  <  0,  if  w**  0,  w;(21>  £  0  . 

Before  doing  so,  we  shall  obtain  an  auxiliary  result. 

It  will  now  be  shown  that  the  norm  of  A~l  can  be  made  arbitrarily  small  by  choosing 
77  sufficiently  large.  It  does  not  matter  which  of  the  many  norms  is  used  (see  Bowker  [6]). 
Note  that,  denoting  by  N(X )  the  norm  of  the  matrix  X,  we  have  N(A  +  B)  fS  iV(^4)  + 
N(B),  N (A B)  ^  N(A)N(B) ;  if  all  the  elements  of  a  matrix  approach  0,  so  does  its  norm. 
If  I  denotes  the  identity  matrix,  N(I)  =  1. 

D~l  is  a  diagonal  matrix  whose  nonzero  elements  approach  zero  for  rj  large;  hence,  the 
same  is  true  of  D~1F.  Therefore,  ij  can  be  chosen  sufficiently  large  so  that, 

(114)  I  —  D~lF  is  nonsingular  , 
and 

(115)  N(D~1F)  <  1  . 

Following  Waugh  (see  p.  148,  [7]),  we  use  the  identity,  valid  because  of  (114), 

(116)  (I  -  =  /+(/-  D~lF)~1D~1F , 

and  the  properties  of  the  norm  to  derive  the  relation, 

(117)  N[(I  -  D~1F)~1]  ^  1  +  N[(I  -  D-'Fy'MD-'F) . 

From  (117)  and  (115),  it  follows  that, 

(118)  N[(D  XF—  I)  M  ^  1  —  ^ ( D~lF)  ' 


SADDLE-POINT  PROBLEMS  1 7 

Since  A  =  F-  D  =  D(D~lF  -  /),  it  follows  that  A~l  =  (LTlF  -  and  hence 

(119)  N(A-')£  N(D-')NUD-'F-I)-']  g 

which  can  be  made  arbitrarily  small  for  rj  large. 

Consider  now  the  quadratic  form  Q  in  (113).  We  have  shown,  using  (101),  that 

(120)  C  =  —  $e**t**  • 

Hence  the  third  regularity  condition,  R3,  implies 

(121)  w**'Cw**  <  0  if  w**  ^  0,  wW  ^  0 . 

As  shown  earlier  N(B'A~lB)  ^  N{B')N(A~l)N(B)  =  N(A~l)[N{B)\*  can  be  made 
arbitrarily  small  by  choosing  a  large  enough  i).  Now 

(122)  \w**' B* A~lBw**  |  ^  N(Bf A~xB)w**'w**  , 

since  the  characteristic  roots  of  a  matrix  are  bounded  in  absolute  value  by  its  norm. 

Also,  denoting  by  n  the  maximum  of  w**'Cw**  subject  to  w**'w**  =  1,  wA21)  ^  0, 
we  have 

(123)  w**'Cw**  ^  nw**'w** 
and,  by  (121),  /t  <  0.  With  the  aid  of  (122), 

(124)  Q<[n+  N(B'A~lB )]w**'w**  if  w**  s*  0,  w/(21>  £  0  . 

By  choosing  jj  sufficiently  large,  so  that 

(125)  n  +  N(B'A-'B )  <0, 

we  establish  (113),  which,  in  turn,  yields  (110),  (90),  (73),  and  hence  theorem  3. 
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Let  the  first  regularity  condition  Ri  hold.  Consider  x  such  that, 

(126)  gw(s)  =  0,  g[21(*)  >  0,  x  ^  0 , 
and  x  such  that, 

(127)  li11  (x  -  *)  ^  0,  xW  -  x&  ^  0  , 


where  all  inequalities  are  to  be  interpreted  in  the  sense  of  (N.1.4).  Define  now  the 
function  g*  of  x  by 

(128)  gf(x )  =  {gt(*),  x**,  x11} 


where  x**  is  defined  by  (58).  Notice  that  assuming  g°  to  be  empty  as  in  (47),  g*,  like  x, 
has  N  dimensions. 

It  follows  that 


(129) 
and  hence 


£z** 

I 

0 


Sin 

0 

/ 


(130) 


ill  I  =  III*  1^0. 


“This  appendix  parallels  lemma  76.1  in  Bliss  (11]. 
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Consider  now  the  relation  which  associates  with  a  real  number  a  the  values  x  of  x  for 
which  the  equation 

(131)  g*(x)  =  g*(x)  +  ag*(x  -  x) 

is  satisfied.  In  virtue  of  the  implicit  function  theorem,  for  sufficiently  small  values  of 
a(131)  defines  x  as  a  (single-valued)  differentiable  function  of  a,  say 

(132)  *=*i(»), 
such  that 

(133)  H0)  =  x. 

Differentiating  (131)  with  respect  to  a  and  setting  a  =  0,  we  have 

(134)  gMO)  =  g*x{x  -  x) 
and  hence,  because  of  (130), 

(135)  t[(0)  =  x  -  x  . 

We  shall  now  show  that 

(136)  ^ i(a)  £  C0  for  a  ^  0,  a  sufficiently  small . 

By  (131),  (126),  and  (127) 

(137)  gt(*)  =  oiglix  —  x)  ^  0  for  a  ^  0 . 

It  follows  that 

(138)  gW(*)  ^  0  for  a  ^  0, 
which  together  with 

(139)  gm[^i(a)]  ^  0  for  a  sufficiently  small , 
yields 

(140)  g[j/ i(a)]  ^  0  for  a  ^  0  sufficiently  small . 

Now,  since  x*  is  a  subvector  of  *0),  is  a  subvector  of  {a:**,  x11 } ,  hence  (127)  and 
(131)  imply 

(141)  xf®  =  ^i2)(a)  =  x,W  -f-  a(*(2)  —  xW)  ^  0  for  a  ^  0 
which,  together  with 

(142)  |0)  =  ^i1}(a)  ^  0  for  a  sufficiently  small,  yields 

(143)  ^i(a)  ^  0  for  a  ^  0,  a  sufficiently  small . 

In  turn,  (140)  and  (143)  yield  (136). 

Now  let  us  interpret  “a  sufficiently  small”  as  0  ^  a  ^  X  where  X  >  0  and  define 
the  function  \p  by 

(144)  rj/(d)  =  rfr i(X0)  for  all  0  ^  0  ^  1  . 

Then 

*(0)  =  x , 

(145)  *'(0)  =  X^(0)  =  \(x-x),  X  >  0 , 

t(6)  €  C0 ,  0  ^  6  <>  1  . 

Since  (145)  are  precisely  the  requirements  of  C.Q.,  it  has  been  shown  that  Ri  implies  C.Q. 
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APPENDIX  II 

We  shall  now  show  that,  if  the  first  two  regularity  conditions  hold  and  if  in  a  neigh¬ 
borhood  of  x,f  and  g  are  assumed  to  possess  continuous  derivatives  of  second  order  with 
regard  to  x1,  then  (71)  is  valid. 

Let  x**  be  nonempty.  Then,  writing 

(146)  l*  =  h*(x )  =  1  (a  vector  of  l’s) , 

(147)  t**  =  x**, 
we  have,  using  Taylor’s  theorem, 

(148)  qit*,  t**)  -  q(i*y  l**)  =  qt**-(t**  -  l **)  +  \(t**  “  l**Yqt**t*<t**  -  I**)  , 

where  over  a  symbol  denotes  the  evaluation  at  t  =  l,  while  “  -  ”  over  a  symbol  denotes 
evaluation  at  t  —  t,  where  1  =  1+6  (t**  —  l**),  0  <  6  <  1.  Now  suppose  it  has  been 
shown  that  (a)  q(l *,  /**)  has,  as  a  function  of  t**,  subject  to  the  constraint  ^  0,  a 
local  maximum  at  /**  =  l**,  and  (b)  qt**  =  0.  From  (a)  it  follows  that,  in  a  sufficiently 
small  neighborhood,  the  left  member  of  (148)  is  nonpositive  if  ^  0.  But  then,  using 
(b),  we  see  that  the  quadratic  form  in  the  right  member  of  (148)  is  nonpositive.  Since,  by 
hypothesis,  qt**t**  is  a  continuous  function  of  /**,  we  have,  for  /(21>  ^  0,  and  in  a  suffi¬ 
ciently  small  neighborhood  of  l, 

(149)  (/**  -  t**)qt**t**(t**  -  {**)  ^  0 

which  is  the  desired  result  (71).  Hence  it  remains  to  prove  (a)  and  (b). 

(a)  q(i*,  l**)  has,  as  a  function  of  t**,  subject  to  /(21)  ^  0,  a  local  maximum  at  t**  =  l**. 
It  follows  from  the  remarks  at  the  beginning  of  the  discussion  of  the  second  regularity 
condition  that  fix1, 0),  as  a  function  of  x1,  has  a  local  maximum  at  x1  =  x1,  subject  to  the 
constraints 

(150)  g\{x*,  0)  =  0,  *(21>  ^  0  . 

Hence,  subject  to  the  same  constraints,  q{t)  has  a  local  maximum  at  l.  Now  we  must 
distinguish  the  two  ways  in  which  the  “milder”  (46)  second  regularity  condition  R2  may 
be  satisfied. 

First  way:  otfix1,  yt)  has  a  nonnegative  saddle-point  at  (x1,  yf),  that  is,  locally,  since 
y°  =  0  by  (43), 

(151)  fix1,  0)  +  y**g*(z7,  0)  ^  fix1,  0)  +  y*-g*ixT,  0) 
for  all  x1  such  that  xW  ^  0. 

But  g*ixT,  0)  =  0  because  of  (150),  and  g*ix*,  0)  in  the  left  member  of  (151)  vanishes 
for  t*  =  i*.  Hence  (151)  yields,  locally  and  for  ^  0, 

(152)  f[r*il*,  t**),  t**,  0]  ^  f[r*il*,  l**),  l **,  0] 

which  means  precisely  that  qit*,  t**)  has  a  local  maximum  at  l**  subject  only  to  /(21)  ^  0. 
Second  way:  g°  is  empty.  In  this  case  (150)  is  equivalent  to 

(153)  g*ix',  0)  =  0, 

(154)  *(21>^0. 

But  (153)  is  necessarily  satisfied  if  t*  =  l*  and  hence  can  be  disregarded.  Since  qit) 
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was  seen  to  have  a  local  maximum  at  l  subject  to  (150),  it  follows  that  q(l*,  t**)  will  have 
a  local  maximum  at  l**  subject  only  to  ^  0. 

(b)  qt*+  **  0  . 

We  have 

(155)  qt**  —  fz*f**+  4"  fz**  • 

We  now  evaluate  the  three  expressions  on  the  right-hand  side  of  (155).  We  start  with 
f ***.  Noting  that 

(156)  t**),  /**],  0}  =  0  for  all  t**  , 

we  obtain  by  differentiation  with  respect  to  /**,  using  (60)  and  (69),  and  evaluating 
at  t  =  l, 

(157)  |M~  +  g!~  =  0; 
in  virtue  of  Ri  this  can  be  solved  yielding 

(158)  r*~= 

To  find  fx*,  Jz**,  we  write  the  condition  that  o 0  V  =  0,  using  equation  (41)  in  the  form 

/**  4*  gt*y*  -  0 , 

(159) 

fx**  4-  gt**y*  =  0 . 

The  terms  involving  g°  vanish,  of  course. 

Substituting  (159)  and  (158)  into  (155),  we  have 

(160)  qt~  =  (-?*•£-)  4-  (-y**gt*)[- (g**)-1^**]  =  (-y*-£~)  4-  (y*-fx**)  =  0 . 
This  completes  the  proof  of  (71). 
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